A Multithreaded Message Passing Interface (MPI) Architecture: Performance and Program Issues

نویسندگان

  • Boris V. Protopopov
  • Anthony Skjellum
چکیده

Device Interface (ADI ) Channel Device Interface Low-level device MPI Implementation MPI Application Communication Device Figure 1. MPICH layered software architecture The deficiencies of the MPICH architecture, such as inefficient multi-fabric communication and non-thread-safety are rooted in the ADI and Device layers. In order to make further discussion specific, we present the ADI and CDI in more detail below. 2.1 Abstract Device Interface (ADI) Message passing in MPI is organized in the following way. When users call MPI_Send or MPI_Recv, a data structure that contains an instantaneous description of the data transfer operation is created (we refer to it as transfer description later in the paper). The data structure is reference by an opaque pointer called handle. The handles to transfers are placed in send and receive queues that are used in order to ensure the correct order of transfer completion. The

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Open Issues in MPI Implementation

MPI (the Message Passing Interface) continues to be the dominant programming model for parallel machines of all sizes, from small Linux clusters to the largest parallel supercomputers such as IBM Blue Gene/L and Cray XT3. Although the MPI standard was released more than 10 years ago and a number of implementations of MPI are available from both vendors and research groups, MPI implementations s...

متن کامل

MPI for Windows NT: Further Study of the Message Passing Interface for Clusters and SMP Environments

Mississippi State University, together with Argonne National Laboratory, developed the most widely used implementation of MPI called MPICH. MPICH emphasizes a Unix environment. This paper continues the study of implementing MPI for clusters of Windows NT workstations. Performance issues pertaining to the architecture of Intel x86 based workstations , Windows NT operating system architecture , a...

متن کامل

Parallel Homologous Search with Hirschberg Algorithm: A Hybrid MPI-Pthreads Solution

In this paper, we apply two different parallel programming model, the message passing model using Message Passing Interface (MPI) and the multithreaded model using Pthreads, to protein sequence homologous search. The protein sequence homologous search uses Hirschberg algorithm for the pair-wise sequence alignment. The performance of the homologous search using the MPI-Pthread is compared to the...

متن کامل

Efficient MPI Support for Advanced Hybrid Programming Models

The number of multithreaded Message Passing Interface (MPI) implementations and applications is increasing rapidly. We discuss how multithreaded applications can receive messages of unknown size. As is well known, combining MPI Probe/MPI Recv is not threadsafe, but many assume that trivial workarounds exist. We discuss those workarounds and show how they fail in practice by either limiting the ...

متن کامل

Overlapping Communication and Computation in MPI by Multithreading

Since the emergence of MPI (Message Passing Interface), how to improve its performance has always been a goal for MPI library implementation. This paper proposes an efficient architecture to achieve this goal from a very low level: using multi-threaded model to implement MPI pointto-point operations in order to overlap communication and computation. Compared to single-threaded active polling co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Parallel Distrib. Comput.

دوره 61  شماره 

صفحات  -

تاریخ انتشار 2001